Latent Contextual Bandits: A Non-Negative Matrix Factorization Approach
نویسندگان
چکیده
We consider the stochastic contextual bandit problem with a large number of observed contexts and arms, but with a latent low-dimensional structure across contexts. This low dimensional (latent) structure encodes the fact that both the observed contexts and the mean rewards from the arms are convex mixtures of a small number of underlying latent contexts. At each time, we are presented with an observed context; the bandit problem is to determine the corresponding arm to pull in order to minimize regret. Assuming a separable and low rank latent context vs. mean-reward matrix, we employ non-negative matrix factorization (NMF) techniques on sub-sampled estimates of matrix entries (estimates constructed from careful arm sampling) to efficiently discover the underlying factors. This estimation lies at the core of our proposed -greedy NMF-Bandit algorithm that switches between arm exploration to reconstruct the reward matrix, and exploitation of arms using the reconstructed matrix in order to minimize regret. We identify singular value conditions on the non-negative factors under which the NMF-Bandit algorithm has O(Lpoly(m, logK) log T ) regret where L is the number of observed contexts, K is the number of arms, and m is the number of latent contexts. We further propose a class of generative models that satisfy our sufficient conditions, and derive a lower bound that matches our achievable bounds up to a poly(m, logK) factor. Finally, we validate the NMF-bandit algorithm on synthetic data-sets.
منابع مشابه
Contextual Bandits with Latent Confounders: An NMF Approach
Motivated by online recommendation and advertising systems, we consider a causal model for stochastic contextual bandits with a latent low-dimensional confounder. In our model, there are L observed contexts and K arms of the bandit. The observed context influences the reward obtained through a latent confounder variable with cardinality m (m ⌧ L, K). The arm choice and the latent confounder cau...
متن کاملA new approach for building recommender system using non negative matrix factorization method
Nonnegative Matrix Factorization is a new approach to reduce data dimensions. In this method, by applying the nonnegativity of the matrix data, the matrix is decomposed into components that are more interrelated and divide the data into sections where the data in these sections have a specific relationship. In this paper, we use the nonnegative matrix factorization to decompose the user ratin...
متن کاملIterative Weighted Non-smooth Non-negative Matrix Factorization for Face Recognition
Non-negative Matrix Factorization (NMF) is a part-based image representation method. It comes from the intuitive idea that entire face image can be constructed by combining several parts. In this paper, we propose a framework for face recognition by finding localized, part-based representations, denoted “Iterative weighted non-smooth non-negative matrix factorization” (IWNS-NMF). A new cost fun...
متن کاملLatent Contextual Bandits and their Application to Personalized Recommendations for New Users
Personalized recommendations for new users, also known as the cold-start problem, can be formulated as a contextual bandit problem. Existing contextual bandit algorithms generally rely on features alone to capture user variability. Such methods are inefficient in learning new users’ interests. In this paper we propose Latent Contextual Bandits. We consider both the benefit of leveraging a set o...
متن کاملHINMF: A Matrix Factorization Method for Clustering in Heterogeneous Information Networks
Non-negative matrix factorization (NMF) has become quite popular recently on the relational data due to its several nice properties and connection to probabilistic latent semantic analysis (PLSA). However, few algorithms take this route for the heterogeneous networks. In this paper we propose a novel clustering method for heterogeneous information networks by searching for a factorization that ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- CoRR
دوره abs/1606.00119 شماره
صفحات -
تاریخ انتشار 2016